Session Timeout Thresholds Impact on Quality and Quantity of Extracted Sequence Rules

نویسندگان

  • Martin Drlík
  • Michal Munk
چکیده

The effort of using web usage mining methods in the area of educational data mining is to reveal the knowledge hidden in the log files of the web and database servers of contemporary virtual learning environments. By applying data mining methods to these data, interesting patterns concerning the students’ behavior can be identified. These methods help us to find the most effective structure of the e-learning courses, optimize the learning content, recommend the most suitable learning path based on student’s behavior or provide more personalized learning environment. We prepared six datasets of different quality obtained from logs of the virtual learning environment Moodle and pre-processed in different ways. We used three datasets with identified users’ sessions based on 15, 30 and 60 minute session timeout threshold and three another datasets with the same thresholds including reconstructed paths among course activities. We tried to assess the impact of different session timeout thresholds with or without paths completion on the quantity and quality of the sequence rules that contribute to the representation of the students’ behavioral patterns in virtual learning environment. The results show that the session timeout threshold has significant impact on quality and quantity of extracted sequence rules. On the contrary, it is shown that the completion of paths has neither significant impact on quantity nor quality of extracted rules.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Web log session identification with statistical language models

on statistical language modeling. Unlike standard timeout methods, which use fixed time thresholds for session identification, we use an information theoretic approach that yields more robust results for identifying session boundaries. We evaluate our new approach by learning interesting association rules from the segmented session files. We then compare the performance of our approach to three...

متن کامل

Automatic Learning of Stemming Rules for the Indonesian Language

We present a method for the automatic learning of stemming rules for the Indonesian language. The learning process uses an unlabelled corpus. In the first phase the candidate (word, stem) pairs are automatically extracted from a set of online documents. This phase uses a dictionary but is nevertheless not trivial because of morphing. In the second phase the rules are induced from the thus obtai...

متن کامل

Impact of Different Pre-Processing Tasks on Effective Identification of Users' Behavioral Patterns in Web-based Educational System

Analyzing the unique types of data that come from educational systems can help find the most effective structure of the elearning courses, optimize the learning content, recommend the most suitable learning path based on student’s behavior, or provide more personalized environment. We focus only on the processes involved in the data preparation stage of web usage mining. Our objective is to spe...

متن کامل

The Impact of Intra-Network Communications of Actors on Financial Reporting Quality by Structural Equations Technique

Actor-network theory, which is considered as a development of socio-technical structuralism school, observes reservation and stability of networks containing personal and impersonal components such as individuals, organizations, communication software and hardware, and infrastructural standards by examination of socio-technical dimensions concurrently.The goal of this research is studying the i...

متن کامل

Quality Control of Widely Used Therapeutic Recombinant Proteins by a Novel Real-Time PCR Approac

Background: Existence of bacterial host cell DNA contamination in biopharmaceutical products is a potential risk factor for patients receiving these drugs. Hence, the quantity of contamination must be controlled under the regulatory standards. Although different methods such as hybridization assays have been employed to determine DNA impurities, these methods are labor, intensive and rather exp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011